Major refactor, API changes, new tests #8

keevindoherty · 2024-08-04T20:21:49Z

Summary

This PR introduces a major project restructure, some API changes, and new tests.

At a high level, the new project structure looks something like this:

mac
├── LICENSE
├── benchmarks /* python scripts for benchmarking */
├── data /* sample datasets */
├── docs  /* doc files, compiled docs */
├── examples  /* example scripts using MAC, including for pose-graph SLAM */
├── mac  /* actual library code; this is what you 'pip install' */
│   ├── optimization
│   ├── solvers
│   └── utils
├── requirements.txt
└── tests. /* test code; mirrors the library structure */
    ├── optimization
    ├── solvers
    └── utils

I won't go into detail about the content that is consistent with what is on main, but in terms of the new files, we have:

benchmarks is for Python performance benchmarks. These are not meant to be unit tests, but instead report on solution quality and timing statistics for a few representative datasets.
mac/optimization is for the core optimization routines we use. This includes the Frank-Wolfe implementation in frankwolfe.py and implementation of linear program oracles for a variety of compact, convex constraint sets in constraints.py.
mac/utils is a directory containing what was formerly in the utils.py and cholesky_utils.py files. These are now split up into separate utility files conversions.py for NetworkX < > MAC format conversions, graphs.py for Laplacian constructors and general graph manipulation utils, rounding.py for rounding to the feasible sets implemented in constraints.py, and cholesky.py for Suitesparse Cholesky support. I also moved fiedler.py containing our custom hooks into the NetworkX TRACEMIN-Fiedler implementation here, but I might move these to a separate fiedler directory along with the specialized Cholesky utilities specific to Fiedler value computation.
mac/solvers contains implementations for three solvers: MAC, GreedyESP [Khosoussi et al. 2019] - a greedy algorithm for maximizing the (reduced) Laplacian determinant, and "GreedyEig" which is a greedy algorithm for optimizing algebraic connectivity.

Bugs squashed

In frankwolfe.py the relative duality gap convergence criterion sets a threshold of the form (dual_upper_bound - f(x)) / f(x) < relative_duality_gap_threshold. This will fail if f(x) happens to be zero, and will "false positive" if f(x) is negative. We fix this by moving the f(x) in the denominator to the right-hand side and taking the absolute value. So we now compute (upper - f(x)) < relative_duality_gap_threshold * f(x). This is more robust numerically, but of course if f(x) is indeed zero, it will never trigger. I think this is OK for now. The fix would likely be to add some absolute duality gap threshold, but this seems pretty hard to specify, since we do not know the "scale" of f(x).
In conversions.py, there were a variety of subtly bugs converting between NetworkX graphs and the MAC edge list format. These were mostly because the code was "overfit" to the specific case where the NetworkX graphs we were interested in were all unweighted (or had weights uniformly equal to one). This is fixed now.

Testing

New tests added to tests/... including:

Tests for the core optimization libraries, running Frank-Wolfe on some simple constrained concave maximization problems (e.g. quadratic objective). We now explicitly test cases like f(x) \approx 0 which would have caused a "divide by zero" error before (see above)
Tests for utils libraries. We now test for connected / disconnected graphs in our Fiedler value computations. The latter test (disconnected graph) is currently failing. We need to fix this to improve support for features onkjd/no-fixed-graph, but we actually have a test now! We also test our routines for converting between NetworkX and MAC, which actually had some subtle bugs (see above).

keevindoherty · 2024-10-24T00:43:31Z

I'm temporarily skipping the disconnected graph test for the Fiedler vector / value computation. I think fixing this should be relatively straightforward - just add a custom TRACEMIN solver to better handle this. A naive solution would be to add some regularization to the anchored Laplacian that gets built inside the solver, e.g. as M = L + reg * I, then compute λ₂(L) = λ₂(M) - reg along with a corresponding eigenvector.

If λ₂(L) = 0, then we just have to return any eigenvector in {v | v ∈ ker(L), v ⟂ 1}. (This could occur, for example, if the corresponding graph had more than 2 connected components).

There are interesting potential performance considerations here, I think. It might be better to attempt to solve for the Fiedler value and vector without this regularization, then catch the internal linear system solver exception and fall back to the regularized version? I'll leave this for a future PR, though.

The remaining TODOs as far as I can tell are:

Hook the new bits up with the pose graph stuff
Implement benchmarks here or make an issue to add in a future PR
Update READMEs

keevindoherty marked this pull request as draft August 4, 2024 20:21

keevindoherty added 25 commits August 10, 2024 14:32

Clean up tests.

dfdd6d3

Tidy up Petersen graph sparsification.

b0dbd03

WIP more tidying.

1872b6d

Reorganizing MAC and Fiedler utilities.

1c098f6

Cache setup.

debcfc8

Full project reorg.

9e8fc49

Test structure now mirrors project structure.

764de70

Update tests.

fb45183

Add future benchmark dir.

541e05a

Move split edges utility into pose graph utils.

128351a

Update tests and conversion code.

85e89dd

Fix typo.

1a81b82

Test main Laplacian builder function.

faf1ffc

WIP tidying Cholesky tests.

18831c5

Move Cache to inner class.

61ebc95

Note on future tests to add.

49d3a99

Add test for complete graph.

7e21f93

Add failing test for disconnected component case.

3b154e3

Fixes for optimization code.

e69b378

Remove unused import.

b2ab8ac

Start tests for optimization code.

7a2691a

Fix relative duality gap check, update constraints docs, and add test.

7b135db

MAC now working.

c75dcab

Some doc updates and minor cleanup.

0b1bcf4

Add __init__ to support test discovery.

587feef

keevindoherty force-pushed the kjd/refactor branch from bcb6bb9 to 587feef Compare August 10, 2024 18:32

keevindoherty added 3 commits August 14, 2024 19:36

Tidying unused names.

628be78

More cleanup.

46c27ab

Fix comment.

6613c4b

Skip test for unimplemented feature.

96f175e

keevindoherty added 7 commits November 2, 2024 13:26

Rm old benchmarks dir.

bf5e3b3

Fix bug in cache usage.

70c7bf4

Add new benchmarks.

49c1959

Add pytest-benchmark to deps.

576c66a

Clean up for Petersen graph example.

55be6ad

Everything should be functional now.

312ebaf

Rm imported GreedyESP depending on cholesky utils.

2977f0d

keevindoherty marked this pull request as ready for review November 8, 2024 02:21

keevindoherty merged commit 079663f into main Nov 8, 2024
1 check passed

keevindoherty deleted the kjd/refactor branch November 8, 2024 02:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Major refactor, API changes, new tests #8

Major refactor, API changes, new tests #8

keevindoherty commented Aug 4, 2024

keevindoherty commented Oct 24, 2024

Major refactor, API changes, new tests #8

Major refactor, API changes, new tests #8

Conversation

keevindoherty commented Aug 4, 2024

Summary

Bugs squashed

Testing

keevindoherty commented Oct 24, 2024